Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation

نویسندگان

  • Hideki Kawahara
  • Hisami Matsui
چکیده

An elastic spectral distance measure based on a F0 adaptive pitch synchronous spectral estimation and selective elimination of periodicity interferences, that was developed for a high-quality speech modification procedure STRAIGHT [1], is introduced to provide a basis for auditory morphing. The proposed measure is implemented on a low dimensional piecewise bilinear time-frequency mapping between the target and the original speech representations. A preliminary test results of morphing emotional speech samples indicated that proposed procedure provides perceptually monotonic and high-quality interpolation and extrapolation of CD quality speech samples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing

This article presents a tutorial information about STRAIGHT and TANDEM-STRAIGHT, a widely used speech modification tool and its successor as well as their application for speech morphing. They share the same concept that periodic excitation found in voiced sounds is an efficient mechanism for transmitting underlying smooth time-frequency representation. They also based on perceptual equivalence...

متن کامل

Exploration of the other aspect of Vocoder revisited ,

This article presents a tutorial information about STRAIGHT and TANDEM-STRAIGHT, a widely used speech modification tool and its successor as well as their application for speech morphing. They share the same concept that periodic excitation found in voiced sounds is an efficient mechanism for transmitting underlying smooth time-frequency representation. They also based on perceptual equivalence...

متن کامل

مدل‌سازی بازشناسی واجی کلمات فارسی

Abstract of spoken word recognition is proposed. This model is particularly concerned with extraction of cues from the signal leading to a specification of a word in terms of bundles of distinctive features, which are assumed to be the building blocks of words. In the model proposed, auditory input is chunked into a set of successive time slices. It is assumed that the derivation of the underly...

متن کامل

Exemplar-based Voice Quality Analysis and Control using a High Quality Auditory Morphing Procedure based on STRAIGHT

This paper tries to introduce a new strategy and tools for voice quality research that complements conventional approaches. A very high-quality speech analysis, modification and synthesis procedure STRAIGHT, which is basically a channel VOCODER based on a pitch-synchronous analysis synthesis framework, was extended to implement auditory morphing in terms of spectral, pitch and voice quality par...

متن کامل

Performance of the Wavelet Transform-Neural Network Based Receiver for DPIM in Diffuse Indoor Optical Wireless Links in Presence of Artificial Light Interference

Artificial neural network (ANN) has application in communication engineering in diverse areas such as channel equalization, channel modeling, error control code because of its capability of nonlinear processing, adaptability, and parallel processing. On the other hand, wavelet transform (WT) with both the time and the frequency resolution provides the exact representation of signal in both doma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003